15 research outputs found

    A novel method for inference of chemical compounds of cycle index two with desired properties based on artificial neural networks and integer programming

    Get PDF
    Inference of chemical compounds with desired properties is important for drug design, chemo-informatics, and bioinformatics, to which various algorithmic and machine learning techniques have been applied. Recently, a novel method has been proposed for this inference problem using both artificial neural networks (ANN) and mixed integer linear programming (MILP). This method consists of the training phase and the inverse prediction phase. In the training phase, an ANN is trained so that the output of the ANN takes a value nearly equal to a given chemical property for each sample. In the inverse prediction phase, a chemical structure is inferred using MILP and enumeration so that the structure can have a desired output value for the trained ANN. However, the framework has been applied only to the case of acyclic and monocyclic chemical compounds so far. In this paper, we significantly extend the framework and present a new method for the inference problem for rank-2 chemical compounds (chemical graphs with cycle index 2). The results of computational experiments using such chemical properties as octanol/water partition coefficient, melting point, and boiling point suggest that the proposed method is much more useful than the previous method

    A new approach to the design of acyclic chemical compounds using skeleton trees and integer linear programming

    Get PDF
    Intelligent systems are applied in a wide range of areas, and computer-aided drug design is a highly important one. One major approach to drug design is the inverse QSAR/QSPR (quantitative structure-activity and structure-property relationship), for which a method that uses both artificial neural networks (ANN) and mixed integer linear programming (MILP) has been proposed recently. This method consists of two phases: a forward prediction phase, and an inverse, inference phase. In the prediction phase, a feature function f over chemical compounds is defined, whereby a chemical compound G is represented as a vector f(G) of descriptors. Following, for a given chemical property π, using a dataset of chemical compounds with known values for property π, a regressive prediction function ψ is computed by an ANN. It is desired that ψ(f(G)) takes a value that is close to the true value of property π for the compound G for many of the compounds in the dataset. In the inference phase, one starts with a target value y∗ of the chemical property π, and then a chemical structure G∗ such that ψ(f(G∗)) is within a certain tolerance level of y∗ is constructed from the solution to a specially formulated MILP. This method has been used for the case of inferring acyclic chemical compounds. With this paper, we propose a new concept on acyclic chemical graphs, called a skeleton tree, and based on it develop a new MILP formulation for inferring acyclic chemical compounds. Our computational experiments indicate that our newly proposed method significantly outperforms the existing method when the diameter of graphs is up to 8. In a particular example where we inferred acyclic chemical compounds with 38 non-hydrogen atoms from the set {C, O, S} times faster

    A novel method for inference of acyclic chemical compounds with bounded branch-height based on artificial neural networks and integer programming

    Get PDF
    Analysis of chemical graphs is becoming a major research topic in computational molecular biology due to its potential applications to drug design. One of the major approaches in such a study is inverse quantitative structure activity/property relationship (inverse QSAR/QSPR) analysis, which is to infer chemical structures from given chemical activities/properties. Recently, a novel two-phase framework has been proposed for inverse QSAR/QSPR, where in the first phase an artificial neural network (ANN) is used to construct a prediction function. In the second phase, a mixed integer linear program (MILP) formulated on the trained ANN and a graph search algorithm are used to infer desired chemical structures. The framework has been applied to the case of chemical compounds with cycle index up to 2 so far. The computational results conducted on instances with n non-hydrogen atoms show that a feature vector can be inferred by solving an MILP for up to n=40, whereas graphs can be enumerated for up to n=15. When applied to the case of chemical acyclic graphs, the maximum computable diameter of a chemical structure was up to 8. In this paper, we introduce a new characterization of graph structure, called “branch-height” based on which a new MILP formulation and a new graph search algorithm are designed for chemical acyclic graphs. The results of computational experiments using such chemical properties as octanol/water partition coefficient, boiling point and heat of combustion suggest that the proposed method can infer chemical acyclic graphs with around n=50 and diameter 30

    A Novel Method for Inference of Acyclic Chemical Compounds with Bounded Branch-height Based on Artificial Neural Networks and Integer Programming

    Get PDF
    Analysis of chemical graphs is a major research topic in computational molecular biology due to its potential applications to drug design. One approach is inverse quantitative structure activity/property relationship (inverse QSAR/QSPR) analysis, which is to infer chemical structures from given chemical activities/properties. Recently, a framework has been proposed for inverse QSAR/QSPR using artificial neural networks (ANN) and mixed integer linear programming (MILP). This method consists of a prediction phase and an inverse prediction phase. In the first phase, a feature vector f(G)f(G) of a chemical graph GG is introduced and a prediction function ψ\psi on a chemical property π\pi is constructed with an ANN. In the second phase, given a target value yy^* of property π\pi, a feature vector xx^* is inferred by solving an MILP formulated from the trained ANN so that ψ(x)\psi(x^*) is close to yy^* and then a set of chemical structures GG^* such that f(G)=xf(G^*)= x^* is enumerated by a graph search algorithm. The framework has been applied to the case of chemical compounds with cycle index up to 2. The computational results conducted on instances with nn non-hydrogen atoms show that a feature vector xx^* can be inferred for up to around n=40n=40 whereas graphs GG^* can be enumerated for up to n=15n=15. When applied to the case of chemical acyclic graphs, the maximum computable diameter of GG^* was around up to around 8. We introduce a new characterization of graph structure, "branch-height," based on which an MILP formulation and a graph search algorithm are designed for chemical acyclic graphs. The results of computational experiments using properties such as octanol/water partition coefficient, boiling point and heat of combustion suggest that the proposed method can infer chemical acyclic graphs GG^* with n=50n=50 and diameter 30

    二部分構造を持つ順序付け問題に対する近似方式

    Get PDF
    京都大学0048新制・課程博士博士(情報学)甲第18621号情博第545号新制||情||96(附属図書館)31521京都大学大学院情報学研究科数理工学専攻(主査)教授 永持 仁, 教授 太田 快人, 教授 髙橋 豊学位規則第4条第1項該当Doctor of InformaticsKyoto UniversityDFA

    λ-Group Strategy-Proof Mechanisms for the Obnoxious Facility Game in Star Networks

    No full text
    corecore